REPORT  DOCUMENTATION  PAGE 

Form  Approved 

OMB  No.  0704-0188 

The  public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information,  including 
suggestions  for  reducing  this  burden,  to  Department  of  Defense,  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports  (0704-0188),  1215  Jefferson  Davis  Highway, 
Suite  1204,  Arlington,  VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  any  penalty  for  failing  to  comply  with  a  collection  of 
information  if  it  does  not  display  a  currently  valid  OMB  control  number. 

PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS. 

1.  REPORT  DATE  (DD-MM-YYYY) 

NOVEMBER  2015 

2.  REPORT  TYPE 

JOURNAL  ARTICLE  (POST  PRINT) 

3.  DATES  COVERED  (From  -  To) 

JUN  2013 -AUG  2015 

4.  TITLE  AND  SUBTITLE 

ON  THE  EXISTENCE  OF  T-IDENTIFYING  CODES  IN  UNDIRECTED 

DE  BRUIJN  NETWORKS 

5a.  CONTRACT  NUMBER 

NA 

5b.  GRANT  NUMBER 

NA 

5c.  PROGRAM  ELEMENT  NUMBER 

62788F 

6.  AUTHOR(S) 

Victoria  Horan 

5d.  PROJECT  NUMBER 

HORN 

5e.  TASK  NUMBER 

IN 

5f.  WORK  UNIT  NUMBER 

HO 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Air  Force  Research  Laboratory/Information  Directorate 

Rome  Research  Site/RITF 

525  Brooks  Road 

Rome  NY  13441-4505 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

N/A 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

Air  Force  Research  Laboratory/Information  Directorate 

Rome  Research  Site/RITF 

525  Brooks  Road 

Rome  NY  13441-4505 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

AFRL/RI 

11.  SPONSORING/MONITORING 

AGENCY  REPORT  NUMBER 

AFRL-RI-RS-TP-201 5-008 

12.  DISTRIBUTION  AVAILABILITY  STATEMENT 

APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED.  PA  Case  Number:  88ABW-20 15-3796 

DATE  CLEARED:  4  AUG  15 

13.  SUPPLEMENTARY  NOTES 

One  or  more  of  the  authors  is  a  U.S.  Government  employee  working  within  the  scope  of  their  Government  job;  therefore, 
the  U.S.  Government  is  joint  owner  of  the  work  and  has  the  right  to  copy,  distribute,  and  use  the  work.  All  other  rights  are 
reserved  by  the  copyright  owner. 

14.  ABSTRACT 

This  paper  proves  the  existence  oft-identifying  codes  on  the  class  of  undirected  de  Bruijn  graphs  with  string  length  n  and 
alphabet  size  d,  referred  to  as  B(d,  n).  It  is  shown  that  B(d,  n)  is  t-identifiable  whenever  d  >  3  and  n  >  2t,  and  t  >  1 ,  or  d  > 
3,  n  >  3,  and  t  =  2,  or  d  =  2,  n  >  3,  and  t  =  1 .  The  remaining  cases  remain  open.  Additionally,  we  show  that  the 
eccentricity  of  the  undirected  non-binary  de  Bruijn  graph  is  n. 


15.  SUBJECT  TERMS 

Identifying  Code;  De  Bruijn  Network;  Graph  Theory 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

18.  NUMBER 

OF  PAGES 

19a.  NAME  OF  RESPONSIBLE  PERSON 

VICTORIA  HORAN 

a.  REPORT 

U 

b.  ABSTRACT 

U 

c.  THIS  PAGE 

U 

UU 

18 

19b.  TELEPHONE  NUMBER  ( Include  area  code) 

N/A 

Standard  Form  298  (Rev.  8-98) 


Prescribed  by  ANSI  Std.  Z39.18 


arXiv:1508.00403vl  [math. CO]  3  Aug  2015 


On  the  Existence  of  ^-Identifying  Codes  in 
Undirected  De  Bruijn  Graphs 

Victoria  Horan  * 

Air  Force  Research  Laboratory 
Information  Directorate 

August  4,  2015 


i 


Abstract 

This  paper  proves  the  existence  of  f-identifying  codes  on  the  class 
of  undirected  de  Bruijn  graphs  with  string  length  n  and  alphabet  size 
d,  referred  to  as  B(d,n).  It  is  shown  that  B(d,n)  is  i-identifiable 
whenever: 

•  d  >  3  and  n  >  2 1,  and  t  >  1. 

•  d  >  3,  n  >  3,  and  t  =  2. 

•  d  =  2,  n  >  3,  and  t  =  1. 

The  remaining  cases  remain  open.  Additionally,  we  show  that  the 
eccentricity  of  the  undirected  non-binary  de  Bruijn  graph  is  n. 


1  Introduction  and  Background 

Let  x  G  F(G),  and  define  the  ball  of  radius  t  to  be  the  set  of  all  vertices  y 
with  d(x,  y )  <  t.  The  formal  definition  of  a  f-identifying  code  on  a  graph  G 
is  as  follows. 

*victoria.horan . l@us . af .mil 
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Definition  1.1.  A  subset  S  C  V{G)  is  a  ^-identifying  code  in  a  graph  G  if 
the  following  conditions  are  met. 

1.  For  all  x  e  V(G),  Bt(x)  D  S  ±  0. 

2.  For  all  x,y  E  V (G)  with  x  y,  we  must  have  5t(a:)  fl  S  Bt(y)  D  S. 

The  first  condition  in  the  definition  requires  that  S'  be  a  dominating  set. 
The  second  condition  requires  that  each  vertex’s  identifying  set  (the  sets 
Bt(x)  fl  S  and  Bfiy )  fl  S)  is  unique.  To  settle  the  question  of  existence  of 
^-identifying  codes  in  a  graph,  we  will  rely  on  the  following  fact.  If  a  t- 
identifying  code  exists  in  a  graph  G,  then  we  say  that  G  is  ^identifiable.  If 
the  variable  t  is  omitted,  then  we  may  assume  that  t  —  1. 

Definition  1.2.  Two  vertices  x,y  are  t- twins  if  Bt  (x)  =  Bt(y). 

Fact  1.3.  A  graph  is  t -identifiable  if  and  only  if  it  does  not  contain  any 
t  -twins. 

Next  we  will  define  the  class  of  de  Bruijn  graphs.  A  good  reference  for 
the  de  Bruijn  graphs  and  some  of  their  properties  is  [T].  First,  we  define 
[d]  =  {0, 1,  2, . . . ,  d  —  1}  (note  that  this  definition  is  non-standard).  Then  we 
define  the  de  Bruijn  graph  as  follows. 

Definition  1.4.  Define  the  set  S(d,  n)  to  be  the  set  of  all  strings  of  length  n 
over  the  alphabet  [d\.  The  directed  de  Bruijn  graph  B(d,  n )  is  the  graph  with 
vertex  set  V  =  S(d,n),  and  edge  set  E  =  S(d,  n  +  1).  An  edge  X\X2  ■  ■  ■  xn+\ 
denotes  the  edge  from  vertex  X1X2  . . .  xn  to  vertex  X2X%  . . .  xn+\.  The  undi¬ 
rected  de  Bruijn  graph  B(d,n )  is  B(d,n )  with  undirected  edges. 

Identifying  codes  were  first  introduced  and  defined  in  [3] .  They  have  many 
interesting  applications,  such  as  efficiently  placing  smoke  detectors  in  a  house 
to  provide  maximum  location  information.  They  are  related  to  (but  different 
from)  dominating  sets,  perfect  dominating  sets,  locating  dominating  sets, 
and  many  more  types  of  vertex  subsets.  In  general,  the  problem  of  finding 
an  identifying  code  in  a  graph  is  an  NP-complete  problem  [3].  Results  on 
the  existence  and  construction  of  identifying  codes  on  the  directed  de  Bruijn 
graph  can  be  found  in  [3]. 

In  Section  [2]  we  prove  results  on  existence  in  B(d,n)  for  d  >  2,  while 
Section  [3]  considers  the  case  when  d  —  2  separately.  Finally,  we  conclude 
with  some  open  problems. 
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2  Non-Binary  de  Bruijn  Graphs 

We  begin  with  our  main  result. 

Theorem  2.1.  B(d,n)  is  t -identifiable  for  d  >  3  and  n  >  2 1. 

To  prove  this  theorem,  we  will  use  several  lemmas  that  we  prove  first. 
The  first  lemma  (from  [2])  is  stated  using  our  own  terminology  and  with  our 
own  discussion/proof. 

Lemma  2.2.  The  strings  in  Bt(x)  for  x  =  x i x-2  . . .  xn  must  be  in  one  of  the 
following  three  sets. 

1.  {x}/ 

2.  [d]9  ©  xfe_/+i . . .  xn_/  ©  \d]b~9  with  b  >  /,  b  >  g,  f  +  b  +  g  <  t; 

3.  [d]f~c  ©  xb+i . . .  xn_f+b  ©  [d]c  with  f  >  b,  f  >  c,b  +  f  +  c  <  t. 

Proof.  All  strings  in  Bt  (x)  can  be  described  by  following  forward  or  backward 
edges.  The  strings  of  type  (1)  are  reached  by  taking  no  moves.  All  other 
strings  (types  (2)  and  (3))  are  reached  by  taking  either  moves  of  type  FBF 
(forward-backward-forward)  or  BFB  (backward-forward-backward).  We  will 
describe  shortest  paths  within  these  confines.  We  define  /  steps  forward  from 
vertex  X\X2  ■  ■  ■  xn  as  reaching  vertices  in  the  set: 

[d\f  ©  xi . .  .xn-f. 

We  define  b  steps  backward  from  vertex  X1X2  ■  ■  ■  xn  as  reaching  vertices  in 
the  set  : 

xb+1 . .  ,xn  ©  [d}b. 

If  FBF  is  the  shortest  path  to  reach  some  vertex  y  from  x,  then  we  must 
follow  /  edges  forward,  b  edges  backward,  and  g  edges  forward,  with  the 
constraints  that  b  >  f ,  b  >  g,  and  /  +  b  +  g  <  t.  Following  these  sequences, 
we  arrive  at  strings  of  type  (2). 

If  BFB  is  the  shortest  path  to  reach  some  vertex  y  from  x,  then  we  must 
follow  b  edges  backward,  /  edges  forward,  and  c  edges  backward,  with  the 
constraints  that  f  >  b,  f  >  c,  and  b  +  f  +  c  <  t.  Following  these  sequences, 
we  arrive  at  strings  of  type  (3).  □ 
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Next,  we  will  look  at  the  possible  ©prefixes  that  can  appear  in  a  spe¬ 
cial  subset  of  Bt(y).  A  ©prefix  of  a  string  xix2--.xn  is  simply  the  first 
t  letters:  x\x2  ...xt.  Since  [d]1  ©  y±y2  ■  ■  ■  yn-t  Q  Bt(y),  if  we  consider  the 
whole  set  Bt(y)  then  every  possible  ©prefix  must  appear.  Instead,  we  want 
to  determine  an  upper-bound  on  the  number  of  distinct  ©prefixes  in  Bt(y )  \ 
( [d] *  ©  yiy2  .  .  .  yn-t )■  Eventually,  we  will  show  that  this  number  of  ©prefixes 
is  smaller  than  dt,  so  we  will  always  be  able  to  choose  a  ©prefix  outside  of 
this  special  subset. 

Lemma  2.3.  For  n  >  2 t,  the  number  of  distinct  t-prefixes  in  Bt(y )  \  [ d ]*  © 
2/12/2  •  •  •  Vn-t  is  at  most 


t- i 


1  -  dlt/2i  +  2  ■ 


j= o 


Proof.  Following  Lemma 12721  the  ©prefixes  in  Bt{y)  take  one  of  the  following 
three  forms  (matching  the  types  in  Lemma  [2.211. 

i-  vm ■■■  Vt\ 

2.  [d]9  ©  2/6-/-I-1  •  •  •  Vt+b—f—g] 

3.  [c^]/_c  ©  2/b+i . . .  2/i+6+c-/- 

In  order  to  more  easily  count  these  t-prefixes,  we  will  sort  them  by  the 
last  letter  that  appears  in  the  ©prefix,  and  then  sort  them  from  longest  [d\l 
prefix  to  smallest.  Since  the  largest  [d]1  prefix  also  counts  the  strings  with 
smaller  [ d ]J  prefix  so  long  as  the  strings  end  in  the  same  letter,  this  will  allow 
us  to  count  unique  prefixes.  We  begin  by  rewriting  the  types  of  prefixes  so 
as  to  more  easily  do  this. 

1-  2/12/2  •••  yt] 

2.  Recall  the  initial  requirements  for  b,  f,g  from  Lemma [2.21  We  find  the 
range  of  y-subsequences  by  noticing  that  b>g  +  l,  f<t  —  b  —  g  < 
t  —  2g  —  1,  and  also  that  b  —  f  is  maximized  whenever  /  =  0.  If  /  =  0, 
then  we  have  either  b  =  t  —  g,  or  if  g  is  large  enough  (i.e.  g  =  (t  —  1) /2) 
we  have  b  —  g  +  1.  Combined,  this  gives  us  the  following  equations. 


min (6  -  /)  =  (g  +  1)  -  (t  -  2g  -  1) 

=  3g  +  2  —  t,  and 
max(6  —  /)  =  rna x(g  +  1,  t  —  g) 

=  t-g. 
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Hence  for  0  <  g  <  L©-: 

[d]9  ©  y3g+2-t+l  ■  ■  ■  V2g+2 

W  ©  yt  -g+ 1  •  •  •  ])2t—2g 

Now  we  consider  all  of  the  possible  last  letters  that  might  appear. 

Last  letters:  yt  such  that  2g  +  2  <  i  <  2t  —  2g. 

Range:  yt  is  a  last  letter  whenever  t  +  1  <  i  <  2t. 

So  as  to  minimize  the  amount  of  double-counting,  we  index  each  of 
these  yi  s  that  appear  by  the  choice  of  g  that  forces  it  to  appear  last. 

Max  g  for  each  yf  L^y^J- 

3.  Note  that  in  this  case,  we  can  cover  all  cases  with  c  >  0  by  a  different 
case  with  c  =  0,  so  we  may  just  consider  the  cases  c  =  0  to  simplify 
things.  This  is  simply  because  if  c  >  0,  we  may  take  f'  =  f  —  c,c'  =  0 
to  obtain  the  same  f-prefix  with  smaller  choices  of  /,  b,  c.  We  use  the 
same  process  as  in  (2)  to  determine  the  possible  last  letters  and  index 
them  to  minimize  double-counting. 

(a)  For  0  <  /  <  *±i; 

[d]f  ©  yi . . .  yt-f 

[d]f  ©  yf...yt- 1 
Last  letters:  Vt-i , . . . ,  yt_  1. 

Range:  y *  is  a  last  letter  whenever  t  —  f  <  i  <t  —  1. 

Max  /  for  each  y%\ 

(b)  For  Wf  <  /  <  t  (recall  we  eliminated  f  —  t): 

[d]f  ©  yi...yt-f 

[d],f  ©  yt-f+i  ■  ■  ■  V2t-2f 
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Last  letters:  yi,y2,..  ■  ,Vt- 2- 

Range:  yt  is  a  last  letter  whenever  t  —  f<i<2t  —  2 /. 

Max  /  for  each  i: 

Note  that  because  we  require  n  >  2 1,  both  cases  (2)  and  (3)  cover  all 
possible  f-prefixes.  That  is,  we  cannot  possibly  have  any  f-prefixes  that  end 
in  [d]k  for  any  k  >  0.  Additionally,  note  that  each  case  covers  a  different 
range  of  last  letters:  (1)  i  —  t;  (2)  t  +  1  <  i  <  2 1;  and  (3)  t  —  f  <  i  <  t  —  1. 
Hence  we  may  count  each  case  separately. 


1.  There  is  only  one  string  in  this  case. 

2.  We  showed  previously  that  max(g)  =  Lht^J-  Thus  we  have  the  follow¬ 
ing  formula. 

(  d ^  +  2  •  22j=o  ^ ^  t  is  °dd; 

1  2  •  Ylj=< o  if  t  is  even. 

3.  In  this  case,  our  subcases  (a)  and  (b)  overlap.  We  break  up  our  ranges 
slightly  differently  this  time  to  determine  max(/). 


(a)  1  <  i  < 

In  this  range  for  i,  we  must  be  in  the  higher  range  for  /,  so  we 
have  rna x(/)  =  [^J. 

(b)  2. 

Considering  both  ranges  for  /,  we  have  the  following  maximum 
value  for  /,  depending  on  i. 


max(/) 


max 


ft  +  1 

2  t  —  i 

2  t  —  i 

V  2  ’ 

2 

)- 

2 

(c)  i  =  t  —  1. 

For  this  value  of  i,  we  must  be  in  the  lower  range  for  /,  and  hence 
we  have  max(/)  =  |_^J  =  L^irJ  • 

Hence  all  cases  (a)-(c)  have  max(/)  =  Thus  we  have  the  follow¬ 

ing  formula. 


2  •  YL\+i  d? ,  if  t  is  odd; 

^  2 

—d?  +  2  •  X)*=±4.1  if  t  is  even. 

3  2  '  1 
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Now  when  we  combine  all  of  our  equations  we  get  the  following  final 
count. 

t- 1 

1  -  d +  2  •  dj 

3=0 

Note  that  this  provides  only  an  upper  bound  on  our  t-prefixes  -  if  we  have 
repeated  letters  than  we  may  have  double-counted.  □ 


Now  we  are  ready  to  prove  our  theorem. 

Proof  of  Theorem \2. 11  Consider  two  arbitrary  strings:  x  =  X\X2  ■  ■  -xn  and 
V  =  2/12/2  •  •  • yn ■  We  will  show  that  these  two  strings  cannot  be  t-twins  by 
showing  that  Bt(x)\Bt(y)  ^  0.  This  will  be  done  in  two  cases:  X\X2  ■  ■  ■  xn-t  ^ 
yiy2  •  •  •  yn-t  and  xt+1xt+2  ■■■xn  ^  yt+Wt+2  ■■■yn-  Note  that  this  covers  all 
cases,  since  x  j -  y  implies  there  is  some  i  G  [1,  n]  such  that  xt  ^  y%.  Addi¬ 
tionally,  since  n  >  2 t,  we  must  have  that  i  G  [1  ,n  —  t\  U  [t  +  l,n].  Hence  at 
least  one  of  these  two  cases  must  be  true. 


1.  XyX2  •  •  •  Xn_t  7^  VlV2  ■  ■  ■  Vn-t- 

We  will  show  that  there  must  exist  some  string  in  Bt(x)  that  is  not  in 
Bt(y).  In  particular,  there  is  a  string  a  G  [df  ©  X\ . .  .xn_t  such  that 
a  ^  Bt(y).  We  do  this  by  counting  the  number  of  distinct  f-prefixes 
in  Bt(y)  \  [df  ©  2/12/2  •  •  • yn-t ,  and  showing  that  this  number  is  smaller 
than  df.  Note  that  because  of  the  case  that  we  are  in,  we  need  not 
consider  the  strings  in  [d]4  ©  yyy2  ■  ■  ■  yn-t ■  If  we  can  show  that  the 
number  of  t-prefixes  is  smaller  than  d t,  then  there  must  be  some  string 
z  e  Bt(x)\Bt(y). 

From  Lemma  12. 3[  we  know  that  the  total  number  of  f-prefixes  in  Bt(y)  \ 
[d)t(&yiy2  •  •  •  yn-t  is  equal  to  l  —  d^i  +2  •  Y^j= 0  $  1  and  that  one  of  those 
t-prefixes  is  y\  . . .  yt,  which  we  may  ignore  because  of  the  case  that  we 
are  in.  Define  fit)  =  —  +  2  •  ^  anc^  d(f)  =  dl  —  fit).  If  we  can 

show  that  g(t)  is  always  positive  for  d  >  3,  then  we  know  that  there 
exists  a  string  a  G  ([d]*  ©  xx . . .  xn_t)\([d]*  ©  yi . . .  yn-t)  Q  Bt(x)\Bt(y). 
Then  we  know  that  x  and  y  are  not  t-twins. 
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Consider  our  new  function  g(t). 


t- 1 

g(t)  =  d4  +  d4/2-2-^V 

d  —  1 

d4(d-l)  +  d4/2(d-l)-2(d4-l) 

d-  1 

We  will  determine  the  nature  of  this  function  by  finding  the  roots. 
We  find  the  roots  by  setting  the  numerator  equal  to  0  and  making  a 
substitution  x  =  d 4/42 . 

d4(d  —  1)  +  d4/,2(d  —  1)  —  2  (d4  —  1)  =  x2(d  —  3)  +  x(d  —  1)  +  2 

The  roots  of  this  equation  are  x  —  —  1  and  x  =  Reversing  our 

substitution  this  equates  to  d4//2  =  —1  and  d f//2  =  The  first  root 

is  impossible,  and  the  second  will  only  be  possible  when  2d  —  6  <  0, 
or  d  <  3.  Hence,  if  d  >  3,  our  function  has  no  real  roots  and  is  always 
positive. 


2-  ^t+l^t+2  .  .  .  Xn  7^  2/t-(-2  ■  ■  ■  Vn- 

In  this  case,  we  want  to  show  that  there  exists  some  string: 

a  G  (xt+i . . .  xn  ©  [d]4)  \  (yt+i  ...yn®  [d]4)  C  Bt(x)  \  Bt(y). 

Because  of  the  symmetric  nature  of  the  strings  and  edges  in  the  de 
Bruijn  graph,  this  case  follows  the  same  as  the  previous  case,  with 
analogous  lemmas  to  Lemmas  12.21  and  12.31  for  t-sufhxes  (instead  of  t- 
prefixes).  Thus  we  will  again  always  have  fewer  than  d4  prefixes  rep¬ 
resented  in  Bt(y)  \  (yt+ 1 . . .  yn  ©  [d]4),  so  we  will  always  be  able  to  find 
the  desired  string  a  that  can  identify  x  from  y. 


□ 


As  a  separate  result,  we  show  that  B(d,  3)  is  2-identifiable  for  d  >  3. 
Theorem  2.4.  B(d,  3)  is  2-identifiable  whenever  d  >  3. 


Proof.  Let  x  =  xxx2x3  and  y  =  2/12/22/3  be  distinct  vertices  in  £>(d,  3).  We 
consider  three  cases. 

Case  1  xi  7^  2/1- 

Let  axa2a3  G  [d]  ©  [d]  ©  £1  such  that  aia2  is  not  one  of  the  following 
strings  or  is  not  contained  in  one  of  the  sets  of  strings. 

[d]  ©  y1 
2/22/3 
2/3  ©  [d] 

2/12/2 
[d]  ©  y2 

We  have  a  total  of  [d]2  options  for  aia2,  and  this  list  contains  at  most 
3d  —  1  of  those  choices.  Hence  for  d  >  3,  there  is  always  an  option  left 
for  aia2.  Then  we  have  ai<22a3  G  B2(x)  \  B2(y). 

Case  2  x3  y3. 

Let  axa2a3  Gi3©  [d]  ©  [d]  such  that  a2a3  is  not  one  of  the  following 
strings  or  is  not  contained  in  one  of  the  sets  of  strings. 

2/12/2 
2/3  ©  [d] 

[d]  ©  2/1 
2/2  ©  [d] 

2/22/3 

Note  that  this  set  of  strings  has  at  most  3d  —  1  elements,  and  hence 
we  can  always  find  some  choice  for  a2a3  that  is  allowed.  Then  we  have 
axa2a3  G  B2(x)  \  B2(y). 

Case  3  x2  y2  and  xxx3  =  yxy3. 

We  break  this  case  up  further  into  three  subcases. 

1.  If  xxx2  =  2/22/3;  then  we  must  have  x  =  abb  and  y  =  aab  for  some 
a  b.  Then  ebb  G  B2(x)  \  B2(y )  for  any  choice  of  c  G  [d]  \  {a,  b}. 

2.  If  x\  =  x3,  then  x  =  aba  and  y  =  aca  for  some  6  ^  c  G  [d].  We 
must  have  either  6  /  a  or  c  /  a,  and  so  without  loss  of  generality 
we  may  assume  b  a.  Then  kab  G  B2(x)  \  B2(y )  for  any  choice 
of  k  G  [d]  \  {a,  c}. 
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3.  Lastly,  if  we  are  not  in  either  of  the  two  previous  subcases  then 
we  choose  aia2a3  G  x±x2  ©  [d]  with  a3  G  [d]  \  {y  1,2/2}-  Then  we 
have  dia2a3  G  B2{x )  \  B2(y). 


□ 

For  the  remaining  cases  where  n  <  2 1,  a  different  argument  must  be 
found.  While  this  problem  remains  open,  we  believe  that  the  following  result 
could  be  useful  in  solving  these  cases. 

Theorem  2.5.  For  any  y  G  B(d,n)  with  d  >  3,  there  exists  some  vertex  x 
such  that  d(y,x )  =  n. 

Proof.  We  proceed  by  induction  on  n  and  show  that  if  the  claim  is  true  in 
B(d,  n )  for  n  >  2,  then  the  claim  is  true  for  B(d,  n  +  2). 

Base  Case:  n  =  2.  Since  d  >  3,  our  vertex  y  =  y \ y2  can  use  at  most 
two  symbols  from  our  alphabet.  Suppose  that  z  G  [d]  \  {yi,y2}.  Then 
d{y,zz )  =  2. 

As  our  induction  proceeds  from  string  length  n  to  n  +  2,  we  require  an 
additional  base  case  of  n  =  3.  If  our  vertex  y  =  y\y2y3  only  uses  two  dis¬ 
tinct  symbols  from  [d],  then  the  string  x  =  an  where  a  G  [d]  \{yi,  y2,  y 3} 
satisfies  d(y,x)  =  3.  Otherwise,  we  must  have  [d]  =  {2/1, 2/25  2/3}-  Then 
the  vertex  x  =  ( y2 )3  satisfies  d(y,x)  =  3. 

Induction  Step:  Let  y  =  y0  ©  y  ©  yn+i  be  arbitrary.  By  the  induction 
hypothesis,  there  exists  some  x  G  B(d,n )  such  that  d(x,y)  =  n.  We 
will  show  that  d(y,x)  =  n  +  2,  where  x  =  x0  ©  x  ©  xn+i  with  x0  G 
[d]  \  {yn,  yn+ 1}  and  xn+1  G  [d]  \  {y0, 2/1}.  We  will  show  that  x  Bn+1(y) 
using  Lemma [2721  and  considering  each  type  of  path  and  resulting  string 
individually. 

1.  x  —  y .  Not  possible  since  x  j -  y. 

2.  FBF-type. 

First,  from  Lemma  12.21  we  know  that  since  d(x,y)  =  n  there 
cannot  exist  any  choice  of  /,  6,  g  such  that  f  +  b  +  g<n  —  l,b>0, 
b  >  /,  and  b  >  g  such  that 


x  G  [d]9  ©  2/6-/+1  •  •  •  yn-f  ©  [d]b  9 ■ 
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In  other  words,  we  must  have 

Db—f+ 1  •  •  •  Un-f  Xg+ 1  •  •  •  Xg+n-b 

for  all  such  choices  of  /,  b,  g. 

Now  we  will  show  that  there  does  not  exist  an  FBF-path  of  length 
n  +  1  or  less  between  x  and  y.  Fix  some  /,  b,  g  such  that  f  +  b+g  < 
n  +  1,  b  >  0,  b  >  /,  and  b  >  g.  From  Lemma  12.21  all  vertices 
zoZi . . .  zn+ 1  that  can  be  reached  by  an  FBF-path  with  parameters 
(/,  b,g)  from  y  must  have 

Vb—f  ■  ■  ■  Vn—f+l  Zg  ■  ■  ■  ^g+n+l—b- 

(a)  If  /  =  0,  b  =  k,  and  g  =  0,  then  we  consider  1  <  k  < 
7i—l  and  n  <  k  <  n  +  1  separately.  First,  if  1  <  k  < 
n  —  1,  then  our  induction  hypothesis  with  parameters  (0,  fc,  0) 
tells  us  that  X\ . . .  xn_k  7^  Vk+i  ■  ■  -Un  when  we  examine  FBF- 
paths  with  parameters  (0,  k ,  0)  from  y.  Hence  we  cannot  have 
xq  . . .  xn-k+i  =  yk  ■  ■  ■  2/n+i,  and  so  no  such  FBF-path  exists 
between  x  and  y.  Next,  if  n  <  k  <  n  +  1,  then  since  Xq  ^ 
ymVn+i,  we  will  never  have  x0Xi  =  ynyn+ i  or  x0  =  yn+1,  and 
so  again  no  such  FBF-path  exists  in  B(d,n  +  2). 

(b)  If  /  >  1,  then  we  must  have  b  >  2.  In  this  case,  in  or¬ 
der  for  such  an  FBF-path  to  exists  from  y  to  x  we  must 
have  xg  . . .  xg+n_b+ 1  =  y^-f  ■  ■  ■  yn+\-f.  However  our  induc¬ 
tion  hypothesis  with  parameters  (/  —  1,  b  —  1,  g)  tells  us  that 
xg+i  . . .  xg+n_b+1  ^  yb-f+ 1  •  •  •  Vn-f+i,  and  so  no  such  FBF- 
path  exists  in  B(d,  n  +  2). 

(c)  If  g  >  1,  then  again  we  must  have  b  >  2.  In  this  case,  in  order 
for  such  an  FBF-path  to  exist  we  must  have  xg  . . .  xg+n_b+i  = 
yb~f  ■  ■  ■  yn+i-f.  However  our  induction  hypothesis  with  pa¬ 
rameters  (/,  6—1,  g— 1)  tells  us  that  xg  . . .  xg+n-b  ^  yb-j  . . .  yn-f , 
and  so  no  such  FBF-path  exists  in  B(d,  n  +  2). 

Hence  we  cannot  have  an  FBF-path  of  length  less  than  n  +  2 
between  y  and  x  in  B(d,  n  +  2). 

3.  BFB-type. 

First,  from  Lemma  12.21  we  know  that  since  d(x,y )  =  n  there 
cannot  exist  any  choice  of  6,  /,  c  such  that  b+  f  +  c  <  n  —  1,  />0, 
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f  >  b,  and  /  >  c  such  that 

x  G  [d)f~c  0  yb+ 1 . . .  yn-f+b  ©  [d}c. 

In  other  words,  we  must  have 

Vb+ 1  •  •  •  y-n—f+b  7^  •£/— c+1  •  •  •  %n—c 
for  all  such  choices  of  5,  /,  c. 

Now  we  will  show  that  there  does  not  exist  a  BFB-path  of  length 
n+1  or  less  between  x  and  y.  Fix  some  b,  /,  c  such  that  b  +  f  +  c  < 
n+1,  /  >  0,  /  >  b,  and  /  >  c.  From  Lemma  12.21  all  vertices 
zoZi . . .  zn+ 1  that  can  be  reached  by  a  BFB  path  from  y  with  these 
parameters  must  have 


Vb  •  •  •  Vn+l—f+b  ^f—c  •  •  •  +i+l —c- 

(a)  If  b  =  0,  /  =  k,  and  c  =  0,  then  we  consider  1  <  k  <  n  —  1 
and  n  <  k  <  n  +  1  separately.  First,  ifl<k<n— 1,  then 
our  induction  hypothesis  tells  us  that  Xk+i  ■  ■  ■  xn  ^  y\ . . .  yn-k 
when  we  examine  BFB-paths  with  parameters  (0,  k,  0)  from  y. 
Hence  we  cannot  have  Xk  ■  ■  ■  xn+i  —  yo  ■  ■  ■  yn-k+i  in  B(d ,  n+ 2), 
so  no  such  BFB-path  exists  between  x  and  y. 

Next,  if  n  <  k  <  n  +  1,  then  since  xn+\  ^  2/o?  2/i?  we  will 
never  have  xnxn+i  =  y0yi  or  xn+i  =  y0,  and  so  again  no  such 
BFB-path  exists  in  B(d ,  n  +  2). 

(b)  If  b  >  1,  then  we  must  have  /  >  2.  In  this  case,  in  or¬ 
der  for  such  a  BFB-path  to  exist  from  x  to  y  we  must  have 
x f—c  . . .  xn+1_c  =  yb . . .  yn+i_f+b.  However  our  induction  hy¬ 
pothesis  with  parameters  {b  —  1,  /  —  1,  c)  tells  us  that 

%  f—c  ■  ■  ■  % n—c  7^  yb  ■  ■  ■  Vn—f+bi 

and  so  no  such  BFB-path  exists  in  B(d,  n  +  2). 

(c)  If  c  >  1,  then  again  we  must  have  /  >  2.  In  this  case,  in 
order  to  have  such  a  BFB-path  between  x  and  y  we  must 
have  x  f_c  . . .  xn+ i_c  =  yb ...  yn+i  -f+b-  However  our  induc¬ 
tion  hypothesis  with  parameters  (6,  /  —  1,  c  —  1)  tells  us  that 
xf-c+1 . .  .xn-c+i  ^  yb+1. .  .yn-f+1+b,  and  so  no  such  BFB- 
path  exists  in  B(d,  n  +  2). 
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Figure  1:  13(2,3)  does  not  contain  any  vertices  at  distance  3  from  Oil. 


Hence  we  cannot  have  a  BFB-path  of  length  less  than  n+2  between 
y  and  x  in  B(d,  n  +  2). 

Therefore  there  is  no  path  from  y  to  x  of  length  n  + 1  or  smaller,  and  so 
d(y,  x)  >  n  +  2.  As  it  is  well  known  that  the  de  Bruijn  graph  B(d,  n  +  2) 
has  diameter  n  +  2  (see  IB),  we  must  have  d(y,  x)  =  n  +  2. 


□ 

In  other  words,  Theorem  12. 51  tells  us  the  eccentricity  of  every  node  in  the 
graph  B(d,n )  is  n  for  d  >  3,  and  so  the  radius  of  B(d,n )  is  n.  Note  that 
when  d  =  2  this  does  not  always  hold.  For  example,  the  graph  B( 2,  3)  does 
not  have  any  vertex  at  distance  3  from  Oil.  See  Figure  [D 

3  Binary  de  Bruijn  Graphs 

We  now  consider  the  binary  de  Bruijn  graphs.  We  provide  one  result  within 
this  range,  and  show  that  B(2,n )  is  always  1- identifiable. 

Theorem  3.1.  For  n  >  3,  the  graph  B(2,n)  is  identifiable. 

Proof.  For  n  =  3,  the  following  is  a  minimum  1- identifying  code  on  B( 2,  3). 

{001,010,011,101} 

When  n  >  4,  we  have  the  following  proof,  with  many  cases.  We  will  prove 
this  result  by  showing  that  it  is  not  possible  to  have  two  vertices  x  and  y 
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that  are  twins.  Suppose  (for  a  contradiction)  that  x  and  y  are  in  fact  twins 
in  13(2,  n).  First,  the  1-balls  for  each  vertex  are  as  follows. 


BAx)  =  l 


XiX2 

•  •  %n 

r  ym 

■  ■  ■  yn 

Oaq . 

.  Xn—\ 

02/1  • 

■ ■ yn— 1 

lxi . 

.  Xn—\ 

>  B^y)  =  < 

lyi . 

• • Vn—1 

x2 .  . 

xnQ 

Ih  ■  • 

■  yn  0 

•.  x2 

^  y 2.. 

•2/nl  y 

Without  loss  of  generality,  we  assume  that  X\  =  0.  Then  we  have  two 
cases:  either  xxx2  ...xn  =  0y1...  yn- 1,  or  x{x2  ...xne{y2...  yn  0,  y2  . . .  yn  1}. 

1.  xix2  ...xn  =  Qyi .  ,.yn- 1- 

In  this  case,  we  know  that  0x2  . . .  xn  —  Or/i . . .  yn-\,  and  so  x2  . . .  xn  = 
yi . .  .yn- 1-  From  this,  we  know  the  following  equality  holds. 

{x2...  xno,  x2...  xni}  =  {2/12/2  ■■■yn,  ym  ■  ■  ■  K} 

This  gives  us  two  cases:  either  ym  . . .  e  {02/! . . .  yn- 1, 12/1  -  -  -  Vn- 1}, 
or  ym  ...y^e{y2...  yn  0, 1/2  ••  •  yni}- 

(a)  yrn  --me  {Oyi . . .  yn-i,  lyi . . .  yn- 1} 

The  fact  that  y2  .  ■  -  yH  —  yi  ■  ■  ■  yn- 1  implies  the  following. 

2/i  =  2/2  =  •  •  •  =  yn- 1  =  1/^ 

Because  we  are  in  Case  1  and  x2  . . .  xn  =  y\ . . .  yn-\,  we  also  have 
the  following  equalities. 

x2  =  x3  =  ■  ■  ■  =  xn  =  yi 

Hence  our  1-balls  must  be  as  shown  below  for  some  a  G  {0, 1}. 


B  Ax)  =  { 


'  0 a. 

.  a 

a . . 

.  aa 

00a 

. .  a 

0a . 

. .  a 

10a 

. .  a 

>  B^y)  =  < 

la . 

. .  a 

a . . 

aO 

a . . 

.  aaO 

a . . 

al 

> 

a . . 

.  aal 

Note  that  since  n  >  4,  we  have  two  strings  in  B\(y)  that  have 
different  second-to-last  and  third-to-last  letters,  however  in  B\(x) 
there  are  no  such  strings.  Hence  these  sets  cannot  possibly  be 
equal,  which  is  a  contradiction. 
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(b)  ym  .■■yne{y2...  yn .0,  y2  ■  ■  ■  yn  1} 

This  implies  that  y\y2  ■  ■  ■  yn- 1  =  IJ2  ■  ■  ■  Um  and  so  we  have  the 
following  chain  of  equalities. 


y  l  =  V2  =  ■■■  =  1-Jn—  1  =  IJn 

Hence  y  =  an  and  x  =  Oa’^1  for  some  a  G  {0, 1}.  Since  x  ^  y,  we 
must  have  a  —  1  and  thus  our  1-balls,  given  below,  are  clearly  not 
equal  -  a  contradiction. 


B^x) 


'  01... 1  ' 
001... 1 
<  101... 1  > 
1...10 
1 ...  11 


Bi{v) 


11...  1 
01...  1 
1...10 


2.  X1X2  . . .  xn  G  {y2  •  •  •  yn0,  y2  ■  ■  ■  yn  1}  and  y2  =  0. 
From  this,  we  have  the  following  1-balls. 


'  0*2  . 

■  Xn 

J/i0a:2. 

.  . 

00a;2 

.  .  Xn—\ 

0yi0x2 

.  .  .  Xn—2 

10x2 

.  .  Xn—\ 

>  £1(3/)  =  < 

iyi0x2 

.  .  .  Xn_2 

x2 . . 

Xn  0 

0x2. . . 

xn_i0 

,  x2-. 

xni 

.  0x2... 

®n-ll 

Now  we  have  two  cases:  either  ly10x2  ■  ■  ■  xn_2  =  10x2  . . .  xn_i,  or 
lyi0x2  •  •  •  zn-2  G  {x2  . . .  xn0,  x2  . . .  xnl}. 


(a)  lyi0x2  . . .  xn-2  =  10x2  . . .  xn-i. 

This  statement  implies  that  we  have  the  following  chain  of  equal¬ 
ities. 

1/3  '  yn  2  '  *£n— 1 

In  particular,  we  now  know  that  x  =  0 a ...  a  and  y  =  00a ...  a. 
Hence  our  1-balls  are  given  below. 


'  0a. 

.  a 

'  00a. 

.  a 

00a 

. .  a 

000a 

. . .  a 

10a 

. .  a 

>  5x(j/)  =  < 

100a 

. . .  a 

a . . 

aO 

0a . . 

aO 

a . . 

al 

> 

0a . . 

al 
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Since  000a ...  a  G  Bi(y),  the  only  way  to  have  B  i(x)  =  Bi(y) 
would  require  a  =  0,  and  thus  x  —  y,  which  is  a  contradiction. 

(b)  1t/i0x2  .  •  •  xn-2  G  {x2  ■  ■  ■  x„0,  x2. . .  xnl}  and  x2  =  1. 

In  this  instance,  we  know  that  x2...xn  =  lyi0x2 . . .  xns,  and 
hence  x$  . . .  xn  —  x2  . . .  xn -3.  This  tells  us  that  x  =  Olj/iOl^/i . . . 
and  y  =  7/1OI7/1OI . . ..  In  particular,  our  1-balls  are  now  shown 
below. 


r  OlyiOlyi ...  ' 

'  yiOlyiOl ...  ' 

OOlyMyi . . . 

QyiOlyiOl . . . 

IOH/1OI  yi... 

>  Bi(y)  =  < 

ly^lyM ... 

lyiOlyi ...  0 

OI2/1OI ...  0 

,  lyiOlyi...!  J 

,01^01...!  / 

Note  that  B\(y)  contains  two  distinct  strings  beginning  with  01, 
while  Bi(x)  contains  only  one  such  string.  Hence  it  is  not  possible 
that  Bi(x)  =  B\ (y),  which  contradicts  our  initial  assumption. 


□ 

Due  to  the  fact  that  the  eccentricity  in  the  binary  de  Bruijn  graph  B( 2,  n) 
is  not  always  equal  to  n,  we  know  that  there  will  be  cases  when  a  f-identifying 
code  does  not  exist. 


4  Future  Work 

We  have  the  following  questions  to  consider. 

1.  Is  there  a  pattern  for  when  B( 2,  n)  is  f-identihable?  A  related  question 
is  to  determine  the  eccentricity  for  the  undirected  binary  de  Bruijn 
graph. 

2.  Can  we  determine  when  B(d,n )  is  /-identifiable  for  d  >  3  and  n  <  2 f? 
Computer  testing  has  led  us  to  conjecture  that  for  d  >  3  and  n  >  2, 
there  exists  a  /-identifying  code  in  B(d,  n)  for  1  <  /  <  n  —  1. 

3.  What  is  the  minimum  possible  size  for  an  identifying  code  in  B(d,  n)7 
Are  there  any  efficient  constructions  for  either  optimal  or  non-optimal 
identifying  codes  in  these  graphs? 
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